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E"H . Abstract 
CO 

The optimal rate of convergence of estimators of the integrated volatility, for a dis- 
continuous Ito semimartingale sampled at regularly spaced times and over a fixed time 
interval, has been a long-standing problem, at least when the jumps are not summable. In 
this paper we study this optimal rate, in the minimax sense and for appropriate "bounded" 
non-parametric classes of semimartingales. We show that, if the rth powers of the jumps 
are summable for some r g [0, 2), the minimax rate is equal to min(y / n, (n log n)' 2_r - ) / 2 ), 
■ where n is the number of observations. 

m" 



1 Introduction 



ON I Let X be a one-dimensional Ito semimartingale, which in particular means that its "continuous 

martingale part" has the form 



/ <J S 
Jo 



X c t = / a s dW s , 



where W is a standard Brownian motion, and the process at is optional and (locally) squared 
^ | integrable. 

One of the long-standing problems is the estimation of the so-called integrated volatility, 
say at time 1, that is of of the variable C\ = f c s ds, where q = erf is the (squared) volatility, 
on the basis of discrete observations of X. A huge number of papers have been devoted to this 
question already, in various situations: when the process is continuous (so X is the sum of X c 
above, plus possibly a drift term), or when it has jumps; when the process X is "perfectly" 
observed, or contaminated by noise; when the sampling times are equi-spaced, or when they 
are irregularly spaced. 

Below, we consider the simplest case, where the sampling is done at the regularly spaced 
times i/n for i = 0, • • • ,n, and when Xij n is observed without noise. Even in this simple 
situation, the question of the "optimal" rate of convergence of estimators towards Ci, as 
n — > oo, is unanswered so far, when there are jumps which are "too active". 
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More precisely, estimators are known, which converge to C\ with the rate y/n, in the con- 
tinuous case (the realized volatility, or "approximate quadratic variation" at time 1, achieves 
this rate), and also when X has jumps with a degree of activity, or Blumenthal-Getoor index, 
less than 1. This rate is optimal (in a minimax sense), for the following reason: if X = aW 
where c = a 2 is a constant, so C\ = c, we have a purely parametric setting for which the 
local asymptotic normality (LAN) holds with rate y/n, and the realized volatility is indeed 
the MLE in this case. 

However, when the degree r of jump activity is larger than 1, the best rates found in 

2 — r 

the literature are of the form n~ £ for e > arbitrarily small (see below for more details). 
The difficulty comes of course from the essentially non-parametric feature of the problem, 
since we do not want to specify the law of the process X, apart from the fact that it is an 
ltd semimartingale, plus possibly some boundedness assumptions on its characteristics. In a 
purely parametric problem, for example when X is a Levy process with a known Levy measure 
and the only unknown parameters are the variance c of the Gaussian part, and possibly the 
drift, then again the rate y/n is available for estimating c (this rate is achieved by the MLE, 
under very general circumstances). 

The aim of this paper is to determine a bound for the minimax rate, when the degree of 
activity is r or smaller (the precise definition of r is given in Assumption (L-r) below, and is 
slightly different from the usual Blumenthal-Getoor index). We will see that the best possible 

2 — r 

rate is (nlogn)~ when r > 1 (and of course y/n when r < 1). It is interesting to notice that 

2-r _ 

the truncated realized volatility, which achieves the rate n 2 £ for any pre-specified e > is 
indeed "almost" rate-optimal. 

The paper is organized as follows: in Section [2] we state the assumptions and review some 
known results. The results of this paper are presented in Section [3j and the proofs are given 
in the last section. 



2 Some known results 

We consider a one-dimensional Ito semimartingale X on a filtered space (Q, J 7 , (J 7 t)t>o, F), 
which is observed at regularly spaced times — for i = 0, 1, • • • , n, over the (fixed) finite interval 
[0, 1]. The characteristics (B, C, v) of X have thus the form 

(1) B t = b s ds, Ct = c s ds, u(dt,dx) = dt F t (dx). 

Jo Jo 

Here, bt and q are optional (or, predictable) processes, with q > 0, and F t = F w j(dx) is an 
optional random measure, also called the Levy measure, which accounts for the jumps of the 
process. 

When X is continuous, the best thing one can do for estimating C\ is to use the realized 
volatility, or approximate quadratic variation at time 1: 

n 

(2) [A,X]? = ^(A"X) 2 , where AfX = X i/n - X {i _ 1)/n , 

i=i 

which converges in probability to C\. When further j^b 2 s ds and c 2 s ds are a.s. finite, we 
have the stable convergence in law 

- r 1 

(3) y/n([X,X]^-Cx) ==* U, where U = y/2 c s dW' s , 

Jo 
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and where W 1 is a standard Brownian motion, denned on an extension of (0, J 7 , (J-t)t>o, P), 
and which is independent of the a-field T . 

When X has jumps, the variables [X, X] n no longer converge to C±, but to the "full" 
quadratic variation [X, X]i = C\ + ^ S<1 (AX S ) 2 , where AX S = X s — X s _ denotes the jump 
size at time s. However, there are two known methods to consistently estimate C\. 

1) Truncated realized volatility: One chooses a sequence v n of positive truncation levels, 
tending to 0, and considers 

n 

(4) a( Wn ) 1 = ^(Arx) 2 i {|Arx| < M . 



2) Multipower variations: One chooses an integer k > 2, and considers 

.. n—k+l k—l 
" L 2/k i=\ j=0 

where m p = E(|[/| p ) is the pth absolute moment of a standard normal variable U (other 
versions are possible, one may for example take any product of k increments, with powers 
adding up to 2). 

The first method has been introduced by Mancini in [4], the second one by Barndorff- 
Nielsen and Shephard in pQ. Both provide estimators which converge in probability to Ci, 
upon rather weak assumptions on the jumps, and as soon as v n x l/n ro for some w £ (0, 1/2), 
for the first method. 

The question of the rate of convergence, though, is still open, and we quickly review the 
known results, in the case of truncated realized volatility. One needs the following assumption, 
where r is a number in [0, 2]: 

Assumption (L-r). The variables sup t<1 \b t \ and sup t<1 q and sup t<1 J(|x| r A 1) F t (dx) 
are almost surely finite. □ 

The bigger r is, the weaker Assumption (L-r) is. (L-2) is a very weak assumption for an 
Ito semimartingale, whereas (L-r) when r < 2 puts restrictions on the jumps, and is slightly 
stronger than saying that the Blumenthal-Getoor index of X (or, jump activity index) is not 
bigger than r. In particular, (L-l) is slightly stronger than the property of the jumps to be 
summable on each finite interval. Note that a stable process of index /3 G (0, 2) satisfies (L-r) 
for all r > /3, but not for r < /3. 

When (L-r) holds for some r < 1, the estimators C(i> n )i enjoys exactly the same CLT as 
in (J3|), with the same limit, provided we have 

(6) v n x l/n ro , with — — < w < 

When (L-r) holds for some r > 1, the CLT with rate y/n no longer holds for C(v n ), but we 
have when v n x l/n m with w G (0, 1/r): 

(7) 0<w<± n^ 2 - r \C{v n )x-C x ) A 
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(convergence in probability). These results are shown in [2j, and Mancini in [5] has proved 
that when the jumps of X are those of a Cauchy process (so (L-r) holds for all r > 1, but not 
for r = 1), the CLT fails. 

Turning now to multipowers, we have analogous results, except that one needs stronger 
assumptions: basically, (L-r) plus the fact that the process c± is also an Ito semimartingale, 
and never vanishes: the CLT for C(k,n)i holds when r < 1, with \[2 replaced by a suitable 
(bigger) constant depending on k, and fails when r > 1, as demonstrated by Vetter in [6], 
whereas when (L-l) holds, we have a CLT with a possibly non- vanishing bias term. 



3 The results 

We are in a non-parametric setting, in which the process X is not specified (apart from the 
fact that it satisfies (L-r) for some r), and even the space (Q, T, (J r t)t>o,IP) is not specified. 
The meaning of "optimality" or "rate-optimality" is not a priori clear; and, to begin with, 
even the quantity to estimate, namely C\, depends of course on the space (O, T, (Jf^OjP) 
and on X. 

A possible setting is as follows. We consider a family S of Ito semimartingales satisfying 
(L-r), each one being defined on its own filtered space ($1, J 7 , (J-f)j>o, IP), and the quantity 
to estimate is the associated integrated volatility C{X)\. Each X in S takes its values, as a 
process, in the Skorokhod space ID 1 of all cadlag functions on M + , and the image by X of the 
observed a-field a(X i i n : i = 0, • • • , n) is the cr-field V n = a(x(i/n) : i = 0, 1, • • • , n) of D 1 . 
For any X £ S we denote by the restriction to T> n of the law of X. 

An estimator at stage n is a Immeasurable function X i— > C(X)2 on B 1 . We say that 
a sequence Cf of such estimators achieves the uniform rate w n (with w n — > oo) on S, for 
estimating C\, if the family w n (C(X)i — C(X)i) is tight, uniformly in n and in X £ S, i.e. 
\C(X)1 - C{X) X \ = P (w~ l ) uniformly in X G S. 

Of course, if S r denotes the set of all Ito semimartingales satisfying (L-r), there cannot be 
any uniform rate because, to begin with, the variables C(X)\ are not uniformly tight when X 
runs through S r : we need to restrict our attention to subfamilies of S r which are "bounded" 
in some sense. In view of the formulation of (L-r), it is natural to consider, for any A > 0, 
the class: 

(8) S\ = the set of all Ito semimartingales with \bt\ + Q + J {\x\ r A 1) F t {dx) < A. 

We also denote by the smaller class of of Levy processes belonging to S r A . 
The main result of this paper is the following theorem: 



the class S^ L , hence also within the bigger class S r A , satisfies (up to a multiplicative constant, 



Theorem 3.1. Let r G [0, 2) and A > 0. Any uniform rate w n for estimating C(X)i, within 
the class S r A 
of course): 

(9) w n < p n :-- 



n if r < 1 

2 — r 

nlogn)~ if r > 1. 



The results recalled in the previous section show that the truncated estimators C(v n )i 
(which are estimators in the sense specified above) achieve the rate p n when r < 1, and 
n ro(2-r) -^rh^ r > 1, for any X satisfying (L-r). We indeed have (slightly) more: 
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Theorem 3.2. Let r € [0,2) and A > 0, and take v n X l/n m . The truncated estimators 
C{v n )i have the uniform rate w n below, within S r A , for estimating C{X)\: 



(10) W r , 



\fn if r < 1 and < w < \ 

n w(2-r) if r>\ and < tu < \. 



When r < 1, the truncated estimators C(t> n )i achieve the optimal rate y/n, and as seen in 
the previous section they even enjoy a CLT. When r > 1 we have the uniform rate n ro ( 2 ~ r ), 
although for any given X we indeed have a "faster" rate, as seen in (JTJ); however this faster 
rate is not uniform inlG S r A , as could be seen by taking a sequence of Levy processes with 
characteristics (0,1, G n ), with J(|x| r A 1) G n (dx) < 1 (so X n G <S[ for all n), but such that 
sup n /{| a .i< e } M r G n {dx) does no£ go to as e — ^ 0. 

We then conclude that the truncated estimators are uniformly rate optimal when r < 1, 
and otherwise they approaches the bound p n , up to n~ £ with e > arbitrarily small, upon 
choosing w close enough to \. 

Let us finally state that on the restricted class S A L the rate p n of © can be achieved 
exactly and thus constitutes the exact minimax optimal rate: this means that for any r £ [0, 2) 
and any A > one can find estimators for C{X)\ enjoying the uniform rate p n . When r < 1, 
we already know this (even for the much larger class S A ) by the previous theorem, but for all 
r G [0, 2) we can construct estimators with the uniform rate p n on S A L as follows. For any 
process X, we consider the empirical characteristic function of the increments, at each stage 
n (below u £ M): 

(11) 4W = -Ve^. 



Then, we set 



n 



~ 2n ~ 

( 12 ) ^'(^1 = (log|^(u)|) 1 { ^ ( ^ 0} - 

Theorem 3.3. For all A > and r G [0, 2) the estimators C'(u n )i with 



n if r < 1 

\J (r — l)nlogn / \/]4 if r > 1 

attain the uniform rate p n for estimating C(X)\, within the class S A ,L of Levy processes. 

So far, we do not know whether these estimators achieve the same optimal rate on the 
larger class S A . 

Remark 3.4. When r < 1 the estimators C'(u n )i are likely to enjoy a Central Limit Theorem 
with rate r n , and with a bias when r = 1. But, at least when r < 1, we already have a CLT 
for the truncated estimators for all semimartingales satisfying (L-r). 

When r > 1, and upon examining the proof (see fli?7| ) and i28\) for example), the estimation 
error C'(u n )i — C(X)i is the sum of a random part, which is easily seen to enjoy a CLT 
with rate n^ 2 ~ r ^ 2 logn, and a non-random part equal to T n = -$rj(l — cos(u n x)) F (dx) , 
where F is the Levy measure of the Levy process X under consideration. Lt turns out that 
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I r nr ra | < J (u n r A |x| r ) F(dx) goes to by Lebesgue's Theorem, so, for any given X we indeed 
have r n (C' (u n )i — C(X)i) — )■ in probability: this convergence is of course not uniform in 
X £ S A ,L , otherwise the conclusion of Theorem \3.1\ would be violated. Now, depending on 
whether p n T n (\ogn) r ^ 2 converges or diverges - and both occurrences are possible - we have a 
CLT with rate /? n (log n)' r / 2 or we do not have a CLT at all. 

4 Proofs 

4. 1 Proof of Theorem I3TTI 

The bound w n < y/n. For proving this bound it is enough to show that it already holds on 
the subclass S A M of all Brownian motions with unit variance c < A (so Sf M C S r / for all 
rG [0,2]). 

In this case, and as already mentioned in the introduction, we have a parametric model 
with parameter c running through [0, A], for which the LAN property holds with rate -y/n, 
and the result follows. 

2 — r 

The bound w n < (nlogn)" when r G (0, 2]. By scaling, if the result holds for one A > 0, 
it holds for all A > 0. Hence, in order to find a bound on the uniform rate w n on S A L , 
hence a fortiori on S A , it is enough to construct two sequences X n and Y n of Levy processes 
belonging to S^ L for n > 2 and some constant K, with the following two properties, where 
a n = (nlogn)~f 2 ~ r )/ 2 : 

(13) • we have C{X n ) 1 = 1 + a n and C{Y n ) 1 = 1 identically 

(14) • the total variation distance between P^- n and Pyn tends to 0. 

Indeed, letting C{X)\ be a sequence of estimators with uniform rate w n — > oo on S r A (or, 
even, on S A ), the two sequences w n (C(X n )™ — (1 + a n )) and w n (C(Y n )^ — 1) are tight under 
P^„ and respectively, by JX3J). Then implies that the sequence w n (0(y n )^-(l+a n )) 
is also tight under Py„. This is possible only if the sequence w n a n is bounded. So l/a n is an 
upper bound for any uniform rate on S 7 ^ (up to a multiplicative constant, of course). 

The proof of (fl"3|) and (fl"4"|) is divided into several steps. 

1) We take Levy processes X n and Y n with respective characteristics (0, 1 + a n ,F n ) and 
(0, 1, G n ), with Levy measures F n , G n satisfying 

(15) J {\x\ r A 1) F n (dx) < K, J (\x\ r A 1) G n (dx) < K 

for some constant K (below constants change from line to line, and may depend on r, and 
are all denoted as K). 

By construction we have f)13|) and X n ,Y n £ Sl^ L for a constant K (which may differ from 
the one in (|15p ). and we need to choose the above measures F n and G n in such a way that 
(fl"4|) is satisfied. 

2) We take u n = 2/al/^ 2 ^ = 2\Jn log n and the even functions h n £ C 2 (M) defined for 
u > by 

K{u) = a n (l {u < Un} + e -(«- u ") 3 l {u>Un} ). 
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We use the following convention for the Fourier transform, namely J-g{u) = J e lux g{x) dx, 
so the inverse is J-~ 1 h(x) = J- J e~ tux h(u) du. 

Since h$ £ L p for all p > 1 and q = 0,1,2, we can define H n = T~ x h n , and we have 
hn^ = i q F~ l H nA , where H n>q (x) = x q H n (x). By Plancherel identity we deduce 

(16) \\H n \y < Ka n u)l 2 < Ka£- 2r)/(4 - 2r) , q = 1,2 => \\H n J h2 < \\h$\\ h2 < Ka n . 



Then the Cauchy-Schwarz inequality applied to the functions ^-^^ and H n (x)yT + x 2 yields 
(17) J \H n {x)\ dx < K(l + a(f- 2r )/( 4 - 2r )) < oo 

(note that ||i7 n || L i is bounded when r < 3/2, but not otherwise; we also have H n (0) > a n u n — > 
oo). Therefore the two measures 

F n (dx) = dXi Gn{dx ) = Fn ( dx ) + En^l dx 

x z x z 

are non-negative and integrate x 2 , hence are Levy measures. 

This construction will satisfy (|14p mainly because the definition of the two Levy mea- 
sures and the constant value of h n for \u\ < u n imply that the difference between the two 
characteristic exponents vanishes for |u| < u n , as we shall prove next. 

3) Splitting the integration domain into the sets {|u| < u n } and {\u\ > u n } in the integral 
J e~ twx h n (u) du, we get 

ry h 1 J < Ka n iu n l{[ s |<l/w n } + pr l{l/u n <|a:|<l} + l{|a;|>l} )• 

In turn, the integral J M-^i \H n (x) \ dx can be split into integrals on the sets {\x\ < l/u n }, 
{l/u n < \x\ < 1} and {|x| > 1}, and recalling 1 < r < 2 we deduce from the above that 

^ 9 A1 \H n (x)\ dx < Ka n {u 2 n - r + 1) < K. 



It follows that the measures F n and G n satisfy f)15[) . and it remains to prove (H 

4) We denote by <fi n and ip n the characteristic functions of X™, and H , and % = 4> n —ijj n . 
These functions are real (because H n is an even function) and are given by 

<f> n (u) = exp ( - 7^ (u 2 + a n u 2 + 20 n (u))\ tp n {u) = exp ( - ^ (u 2 + 2(j) n (u) + 2rj n (u))) 
where n (it) = j(l — cos(ux)) ^ Hn J^^ dx, rjn{u) = f(l — cos(ux)) Hn {f) dx. 

We proceed to studying 4> n and rj n . (fl"6|) applied with q = 1, 2 implies that 4> n and rj n are 
twice differentiable. First, we have (f>' n (u) = J sin(u:r) ^ Hn ^ x ^ dx, hence (|17[) yields 

(18) < Mu) < K{1 + at 2r)/(4 - 2r) ) u 2 , KH| < K(l + ^-^/(^ar)) \u\. 
Second, rf^(u) = J cos(ux) H n (x) dx = h n (u), whereas 7/(0) = r? n (Qi) = 0, and this yields 

2 

/^g\ \u\ <u n =>• rjniu) = ^f-, rj' n (u) = a n u 

\u\ > u n \rjn(u)\ < \v'n(u)\ < a n \u\. 
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5) Since X n and Y n have a non-vanishing Gaussian part, the variables X™/ and Yy n 
have densities, denoted by f n and g n , and we set k n = f n — g n . Since X n and Y n are Levy 
processes, the variation distance between P^-„ and Py n is not more than n times J \ k n (x) \ dx, 
and we are thus left to show that n J \ k n (x) \ dx — > 0. 

To check this, we use the same argument as for (fT7|) : if k n ^(x) = xk n (x), by Cauchy- 
Schwarz inequality we have J \k n (x)\ dx < K(\\k n \\j2 + ||&n,i||]L 2 )' whereas r] n = Fk n and also, 
since rj n is twice differentiable, r]' n = iTk n ^\. By Plancherel identity, it is thus enough to prove 
that 

(20) n 2 / \r] n (u)\ 2 du 0, n 2 \r]' n (u)\ 2 du -> 0. 



We have <f> n > and 4> n + r\ n > 0, which implies 4> n {u) < e " 2 / 2n and ip n (u) < e " 2 / 2n ; 
whereas 2f] n (u) = a n u 2 if |u| < ii ra and |2r/„(u)| < a n -u 2 if |u| > u„ by (fT9|) . Therefore 

VMI = ^n(n) =^( U )|l- e -K« 2 -^W)/( 2 ™)| < ^e- 2 / 2 "l {M>M , 



and also, upon using (fl8j) . 



2ri 



l*7ra( u )l = - + M% + <t>' n {u))(j) n {u) - (u + 4>' n ( u ) + ^( u ))^n(«) 
n 

< 



- (a n |u|e-" 2 / 2n + \ V ' n (u)\e- u2 / 2n + \u + (t>' n {u)\ l {H>Un} 
< Ka n M e- 2 / 2 «(l + (1 + a!^)^) 1 {H>M . 



n v n 

Now, since u n = 2Vnlogn, we have / {H>Un} {^f e~ u2 l n du < K ^j)^ for q = 1,2,3. 
Since further an 2r ^ 4 2r ' /^/n — > 0, we deduce 



/ 



( m2j / ^logn /" , 2 (logn) 2 

f/nMI d« < ^"^/Ji / Kwl dn < ^ n7 „ 1/2 ■ 

Then ([20]) follows, and the proof is complete. 
4.2 Proof of Theorem I3T21 

Any l£5^ can be written as follows, on some space (O, JF, (J r t)t>o,^')- 

( 21 ) Xt = X + f*b s ds + jl^~ s dW s + /„* L5(s,z)l {mS;Z) \\< :1 y(n-v)(ds,dz) 

+ Jo ^) 1{||5( S ^)||>1> n(ds,dz). 

Here, b and c are as in (L-r), and If is a standard Brownian motion, and /x is a Poisson 
random measure on M+ x R with intensity measure v(dt, dz) = dt® dz, and S = S(uj, t, z) is 
a predictable function on Q x M + x R. The connection between 5 and Ft is that is the 
image of Lebesgue measure by the map z i— y 5(u>, t, z), restricted to 
We use the decomposition X = X' + Y + Z, where 



X' t = X + [ b s ds+ [ ^SdW s 
Jo Jo 
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and Y and Z are, respectively, the last two terms in (|2ip . 

With w n given by (fT0|) . it is clearly enough to prove that, for some constant K only 
depending on A,r,zu (as will be all constants K below, changing from line to line), we have 

(22) ¥,{\C{v n )x-Cx\) <K/w n . 
The proof requires several steps. 

1) With the notation (J2J, Ito's formula yields [X',X']^ - C\ = U n + V n , where 

= E7=i E (C I %-!)/»), Cf = 2 J ( f 1)/n (^ - X' (i _ 1)/n )b s ds 

v n = T2=i . c = 2 Jofe)/„w - + cr - m i 

(p5|) yields 

\nc i ^-D/n)| < ^ 3/2 , mo 2 ) +mo 2 ) < ^/n 2 , 

whereas E(£? | J(j_i)/ n ) = 0. Thus we have E(|J7 n |) < K/^/E and E(V 2 ) < K/n, implying 

(23) W,{\[X',X'Y{-C x \)<K/yft. 
Therefore it remains to prove that 

(24) E(\C(v n )i - [X',X'] n i\) < K/w n . 

2) Here we recall estimates on the increments of X' and Y, the later coming from Lemmas 
2.1.5 and 2.1.6 of [3_J, and where p > is arbitrary (the constants K p below depend on 
p in addition to r, A). Namely, since J/M<n \x\ r Ft{dx) < A, we have uniformly in s £ 
[(i - l)/n,i/n\. 



(25) 



n\X' s - X'yJ) < K p n-vl\ E(\Y S - Y {i _ 1)/n \*) < Kn~^ M . 



We will also use the following estimates, which follow from the property Ft{{x : \x\ > 1}) < A 
and from the fact that if A™Z 7^ there is at least one jump of Z within the interval (^-, — ] 
(this estimate follows from Lemma 2.1.7 of [3] applied to the counting process J2 s <t l{AZ s ^o}) : 



(26) 



"(AfZ / 0) < — . 

n 



3) Consider the case r < 1 first. By Lemma 13.2.6 of [3], applied with k = 1 and 



F(x) = x , hence s' = 2 and m 



p' = 1 and 9 = (with the notation of this lemma) , we 



have 



E 



n 

(|cMi-E( A ^') 2 i{|A r 

1=1 



X'\<v n } 



< 



K 



< 



K 



1 



4-2r' 



On the other hand, (1251) and Markov 



where the last inequality follows from w > 
inequality yield E((A^X') 2 1{|a™X'|>i>„}) < Kp/n 1 ^ 1 ^ 2 ^/ 2 for any p > 0, and upon taking 
p = i^ 2vo we obtain 



E 



[x',x^-^2(a:x') 2 i U a 7 



X'\<v n } 



i=l 



< 



K 



v 
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These two estimates readily give (|24|) . 

4) Now we turn to the case r > 1. One has C(v n )\ — [X' ,X']i = Y^j=\ U{j) n , where 

V (l)f = (AfX) 2 l {lA? x\< Vn} - (Af X') 2 - 2A?X'A?Y 

7,(2)? = 2E(A? X'A?y | J- (i _D/J, »/(3)? = 2AfA"A«y - r,(2)f. 

Ito's formula yields, with the notation 7 S = J/u|<-n F s (dz), and taking advantage of the 

facts that y and J Q * e?W s are two orthogonal martingales and that Y t 2 — J* * j s ds is a 
martingale: 

r?(2)f = 2E(/ ( f 1)/n (X^ - X^J&.cfa | .F (i _ 1)/n ) 
E((A? X'Afy) 2 | ^-D/n) = E( / ( l n 1)/n (y s - y ( ,-i)/n) 2 c s ds | j- (i _ 1)/n 
+ 2E ( W " X {i-i)/n)( Y s ~ y^-D/n? bs ds | F {l _ 1)/n 

+e( r;;: 1)/n (x' s - X 'f ls ds i ^ {4 _ 1)/B 



l(!-l)/nV yl s ^ v (i-l)/r. 

Then standard estimates and ([25]) . plus Holder's inequality, yield (the first bound is a.s.): 

< ^ e((^(3)D 2 ) < f • 



Since E(r?(3)™ | J^/J = 0, these estimates yield |E/"(2) n | < K/^/H and E(£/(3) 2 ) < K/n, 
hence it is enough to show that E(|C/(1)„|) < K/w n . 

4) Recalling r > 1, the following inequality is easy to check, for x,y, z £ M and v G (0, 1/4]: 

| (x + y + zf l { \ x+y+z \< v} - x 2 - 2xy\ < 
2v 2 l {z ^o} + 6|xy| l{[a;[> v /2} + 6x 2 l{| x | >i; /2} + 16v 2 - r \y\ r . 

It follows that \ < KJ2 5 j=i£(j)i, wh ere 

= vl l{A«z^o } , ^(2)? = |A"X'A™y| l mx ,\ >Vn/2} 

am = (Afxr i { | A?X 'i>, n /2}, am = vi~ r \*?y\*. 

([26]) yields E(£(l)f) < K/n 1+2 ™, and ([25]) yields E(£(4)f) < if/n^^K Another appli- 
cation of ([25]) . plus Holder and Markov inequalities, give us E(£(j)™) < K p /n 1+ ^ 1 - 2 ^/ 2 for 
j = 2, 3. Upon taking p large enough, we obtain 

m(J)i) < K /nw n 

for j = 1,2,3,4,5. We deduce E(|f7(l) n |) < K/w n , and the proof is complete. 
4.3 Proof of Theorem Q 

We let X e S^' , where r e [0,2) and a > are given. The characteristic triple of X is 
(b,c,F) and the characteristic function of X\i n is 

1 /. , cu 2 



<j) n {u) = exp (- (iub - ^- + / (e iux - 1 - ittx 1 {|2| < 1} F(cte))) . 
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Then |^ n (u n )| = e 2« ( c "«+>) f where j n = 2/(1 — cos(u n x)) F (dx) . The estimation error 
C'(u n )i — c is the sum G n + of the deterministic and stochastic errors: 

G ™ = -72 !og|^K)| -C = -j, H n = ~2 ( log|0nK)| " (log|0 n (u)|) ^(^O} ) " 

We study these two errors separately. We have 1 — cosy < 1 A y 2 < \y\ r A 1. Since by 
hypothesis c + / (\x\ r A 1) F(dx) < A, it follows that 

< 7„ < 2 J (\u n x\ r A 1) F(dx) < 2Au r n . 

This implies, with T = exp max n >i — ^ < oo when r > 1 and T = exp(— 2 A) if r < 1: 

(27) \G n \ < 4^, TT^i = e c <l 2n e^ 2n < T ' 2 . 

Un \<Pn{Un)\ 

Second, we study H n . The variables exp(ii n A"X) are i.i.d. as j varies, with modulus 1 

and expectation cf) n (u n ), hence V n = (f> n (u n ) — <f>n(u n ) satisfies E(|V^| 2 ) < 1/n. Thus, on the 
set {\V n \ < l/n r / 4 }, we have \V n /(f> n (u n )\ < 1/2 and 4> n (u n ) = V n + (f> n (u n ) / as soon as 
n > n = (2r) 4 /« 2 - r ) Ar ), in which case we deduce, for some universal constant K: 



\H \- 2n 

< 



I Vn 

logl+ " 



<K " Ir " 



4>n(u n ) U 2 \4>n(u n )\ 



Henceforth, if n > no, 



( 28 ) H\H n \ 1 {|V n [<l/ n V4}) < { KAY if r>1 

^ (r— l)n( 2 r '/ 2 log n 

Putting together ([27]) and ([28]), plus the fact that F(\V n \ < l/n r / 4 ) < l/n^/ 4 (by 
Bienayme-Tchebicheff inequality), and the equality C'(u n )\ — c = G n + H n , we deduce that 

,r,I 
A 



p n (C (u n )i — c) (with the notation ([9|)) is tight, uniformly in X G S r ' L 
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